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Abstract. We explore the question of state estimation for a qubit restricted to the 
x-z plane of the Bloch sphere, with the trine measurement. In our earlier work [H. K. 
Ng and B.-G. Englert, eprint [arXiv: 1202.513 6 [quant-ph] (2012)], similarities between 
quantum tomography and the tomography of a classical die motivated us to apply a 
simple modification of the classical estimator for use in the quantum problem. This 
worked very well. In this article, we adapt a different aspect of the classical estimator 
to the quantum problem. In particular, we investigate the mean estimator, where 
the mean is taken with a weight function identical to that in the classical estimator 
but now with quantum constraints imposed. Among such mean estimators, we choose 
an optimal one with the smallest worst-case error — the minimax mean estimator — 
and compare its performance with that of other estimators. Despite the natural 
generalization of the classical approach, this minimax mean estimator does not work 
as well as one might expect from the analogous performance in the classical problem. 
While it outperforms the often-used maximum-likelihood estimator in having a smaller 
worst-case error, the advantage is not significant enough to justify the more complicated 
procedure required to construct it. The much simpler adapted estimator introduced in 
our earlier work is still more effective. Our previous work emphasized the similarities 
between classical and quantum state estimation; in contrast, this paper highlights how 
intuition gained from classical problems can sometimes fail in the quantum arena. 
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1. Introduction 

Tomography is the process of characterizing a physical system. In the simplest scenario, 
it estimates a single parameter of concern, for example, the transmission probability of 
a beam-splitter used in an optics experiment. In the most general case, tomography 
involves estimating the state of the system, which provides the complete description of 
all properties of the system. Here, we focus on the latter case of state tomography. 

Tomography is an old subject, well explored first in the classical context (see, for 
example, Ref. p]), later in quantum scenarios (see, for example, the classic textbook by 
Helstrom [2]), and is still a very active area of research. For a reasonably recent and 
comprehensive review of developments in quantum tomography, we point the reader to 
the collection of articles found in Ref. [3J. Here, we focus only on a general overview 
of the topic, and provide just sufficient background for the reader to understand the 
context of our current discussion. 

Tomography involves two steps: first, the measurement of many identical copies 
of the system, which requires a choice of measurement; second, the estimation of the 
quantity of interest — be it a single parameter, or the full state — from the gathered 
measurement data, which requires a choice of estimator. The choice of measurement 
can vary from single-copy measurements where one measures one copy at a time, to a 
joint measurement implemented on all available copies of the system at once. We focus 
here on single-copy measurements. In particular, we discuss the simplest case of having 
the same measurement on every copy of the state, as opposed to adaptive strategies 
where the measurement to be performed on subsequent copies is modified according to 
the data collected from previous copies. 

The choice of estimators — mathematically describable as maps from the set of 
possible data to the set of possible states — is equally varied. Estimation theory, as 
discussed by statisticians, explore estimators from the often-used maximum-likelihood 
(ML) estimator, to classes of estimators like Bayes estimators, minimax estimators, etc., 
each motivated by its own philosophy of inference from the data. These are all instances 
of point estimators where one provides a single state (as opposed to a set of states for 
region estimators) as the result of the tomography. 

Estimation theory, originally invented in the context of classical problems, is also 
applicable to the estimation of quantum states. By classical, we simply mean that 
there exists a preferred basis for the system, and all states are described by probabilistic 
mixtures of basis states. In contrast, quantum systems do not possess such a preferred 
basis, and describing quantum states requires not just probabilities, but probability 
amplitudes. This difference between quantum and classical systems complicates the 
issue of state estimation. Taking a frequentist's perspective, the relative frequencies of 
the measurement outcomes computed from a given set of data should approximate the 
probability of getting each outcome for the input state. A good guess for the state will 
thus be one with outcome probabilities equal or close to the obtained relative frequencies. 
For classical systems, every probability distribution corresponds to a physical state of 
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the system. Not so for quantum systems. Outcome probabilities for a physical state 
satisfy a set of constraints dictated by quantum mechanics, but relative frequencies are 
unconstrained. In fact, situations exist where violation of the constraints by treating 
relative frequencies as probabilities is generic rather than unusual. Naively equating 
outcome probabilities to relative frequencies and using this to reconstruct the quantum 
state can thus result in an unphysical estimator. 

Nevertheless, we can usefully think of quantum state estimation as classical state 
estimation with constraints imposed on the probability distributions describing the 
states. One needs to then invent methods of modifying estimators from classical 
estimation theory to enforce these constraints. In |4J, we did this by an ad-hoc "minimal 
correction", by admixing the completely mixed state to the classical estimator, by an 
amount chosen in a minimax way, to render the resulting estimator — which we refer to 
as the corrected minimax estimator — physical. More generally, one can modify the 
estimators by adopting the same inference philosophy as in the analogous classical 
problem, but now incorporating the physicality constraints required by quantum 
mechanics. For example, ML methods prescribe a constrained maximization of the 
likelihood function over the set of physical quantum states (see, for example, [5]), as 
opposed to reporting the (unconstrained) maximum of the likelihood function as the 
estimator, which may happen to be outside of the set of physical states. 

In this work, we adopt the minimax philosophy that leads to a good estimator 
for the classical die problem, and examine the analogous estimator for the problem of 
tomography of a qubit. To restrict to the simplest case with quantum features, we 
consider tomography of a qubit state, with the promise that the state of the system lies 
solely in the two-dimensional x-z plane of the Bloch sphere. Albeit a rather artificial 
promise, one can view this as tomography of a qubit system where we are interested only 
in the information restricted to the x-z plane. For example, this is of practical relevance 
to the four-state BB84 quantum key distribution scheme [6], which uses only a pair of 
conjugate bases of states lying entirely in the x-z plane, and hence only information 
pertaining to that plane is of relevance. We make use of the trine measurement, 
with outcomes as subnormalized projectors onto the trine states — three pure states 
symmetrically arranged in the x-z plane of the Bloch sphere. The trine measurement is 
informationally complete for the x-z plane of the qubit Bloch sphere. 

In [I], we showed how to make use of the similarities of between classical and 
quantum state tomography to construct simple estimators for the quantum problem 
that perform well and inherit desirable properties of the classical estimator. In contrast, 
here we show that, despite the similarities, applying the same philosophy that worked 
well in the classical die problem to our trine problem, the seemingly minor and rather 
natural modification of the classical estimator turns out to work poorly in the quantum 
case. Although this modified estimator still gives better performance (as quantified 
by the mean squared error) than the ML estimator, the additional computation effort 
needed to gain the small advantage seem hardly worth the trouble. This is particularly 
so given that our simple ad hoc procedure in [3] gives significantly better results. While 
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[I] highlighted the similarities between classical and quantum state estimation, here 
we show how simple quantum constraints on the probabilities can greatly complicate 
matters and lead to results noticeably divergent from the classical case. 

In the next section, we begin with the problem of state estimation for a classical 
i^-sided die. We review the notion of a mean estimator, and explain the minimax 
approach to finding the optimal estimator. Using the mean estimator motivated by the 
classical problem, in Section |3j we examine the problem of estimating a qubit state with 
the trine measurement. Again, we follow a minimax approach in choosing the optimal 
mean estimator, and compare its performance with that of the ML estimator and the 
corrected minimax estimator. We conclude in Section HI 

2. The classical K-sided die 

We begin our discussion by considering a classical i^-sided die, for which we are 
interested in finding out the different weighting of the faces of the die. This problem is 
well-known in the classical literature; our review of it here serves to define the notation 
and also motivate the ideas to be used later when studying the quantum problem. 

2.1. The measurement: a die toss 

We toss the die — a measurement — and ask which face of the die turns up. The set of 
outcome probabilities provides a complete description of the die. To phrase this in a 
more formal language suitable for discussing tomography, to each face of the die, we 
ascribe a pure state \k) such that (k\l) = 5m, for k, I = 1, 2, ... , K. The different faces 
of the die thus correspond to orthonormal states, and {\k)}£ =1 is the preferred basis for 
the die. The die is described by a probability distribution {pk}k=n where J2k=iP^ = 1 
and pk > for all k. Each p^ describes the probability that face k turns up when the 
die is tossed. We can also write the state of the die as a positive semi-definite operator 
p with unit trace given by 



A single toss of the die can be described by a probability operator measurement 
(POM), with outcomes Uk = \k) (k\, k = 1, 2, . . . , K. 11% is associated with the outcome 
that the kth face of the die turns up in a toss, and Born's rule gives pu = tr(pUfc) 
as the outcome probabilities for the state p. A die toss is an instance of a symmetric 
measurement, or an "S-POM" (see Appendix A of |f] for more details on S-POMs). 
Every S-POM has, apart from the outcome operators {Il/ C }^ 1 , a set of hermitian, 
trace- 1 operators {Afc}^ with the defining property that trjrifcA/} = 5m- This allows 
the expansion of the part of the state measured by the S-POM as 



K 




(1) 



k=l 



K 




(2) 



k=l 
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As detailed in jl], the operators can be explicitly constructed given the Hfc operators 
for an S-POM. For the case of the classical die toss, Ak — Hk — \k)(k\. Comparing with 
([I]), we see that every state of the die can be written as in §2§. Thus, the die toss is 
also informationally complete (IC), in that it measures all aspects of the information 
pertaining to the die. The die toss is thus an example of a symmetric, informationally 
complete POM, or SIC-POM for short. 

Repeated tosses of the die can be thought of as repeated measurements on multiple, 
identical copies of the die. Measurement on N copies yields data = {ci, c 2 , . . . , cn}, 
where c, = 1,2, ... ,K indicates the outcome obtained in the measurement of the ith 
copy. The data can be summarized as ~ {n 1; n 2 , . . . nx}, where is the number 
of "clicks" in the fcth detector, indicating the number of times outcome k was obtained. 
Note that Y2k=i n k = ^ ' ■ 



2.2. The mean estimator 

Associated with the data ~ {ni, n 2 , . . . , n^} is the likelihood function 

K 

C(D n \ P ) = Y[tF, (3) 

k=l 

which is the probability of obtaining data given a state p. ML methods suggest one 
to use as the estimator, the state that gives the largest likelihood for the data D^. For 
the classical die, this yields the estimator 

Pml(Dn) = ^(fffcWAfc = ^y](pk)ML\k)(k\, with {p k ) M h = (4) 
k k 

Alternatively, one can view the likelihood as a weight over states, and choose as 
our estimator, the weighted average over all states. This gives the mean estimator, well 
known in classical state estimation, 

~ (n x _ S d(j)(p)C(D N \p)p 
J d(f)[p)C[D N \p) 

where d<fr is an integration measure that tells us how to sum over states. d<f) is non- 
negative on physically permissible p, and zero elsewhere. A Bayesian approach to state 
estimation will set d0 as the prior distribution, encompassing all prior information one 
has about the system to be characterized. The mean estimator in this case will then 
simply be the average state of the posterior distribution d4>(p)C(D^\p) (see, for example, 
Ref. [7] for a recent discussion of the Bayesian mean approach to quantum tomography). 
More generally, d<ft is can be thought of as a functional parameter that characterizes the 
class of all mean estimators. 

Parameterizing p by the outcome probabilities p k s via J d(p can be written as 

J d0(p) (...) = J{d P ) X {p)f{p){---), (6) 

where p denotes the list of probabilities {pi,p 2 , ■ ■ ■ ,Pk}, (dp) = dpi . . . dpx, x(p) is 
a characteristic function that accounts for physicality constraints on p, and f(p) is a 
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non-negative weight function that can be adjusted to optimize the performance of the 
estimator with respect to a desired figure-of-merit. The mean estimator can thus be 
written as 

PMe{D N ) = > (PfcjMEAfe, with {p k )ME = r00f , v , , rlr . | v , (7) 

where the lower integration limit of ensures pk > for all k. For the classical die, 

x(p) = «(i-E»)' (8) 

fc 

where the delta function enforces that the probabilities sum to 1. This restricts the 
integration to over physical states of the classical die only. 

The invariance of the physical properties of the die under the interchange of the 
(arbitrarily assigned) labels k for the faces suggests consideration of an f(p) that is 
unchanged under a permutation of the labels. The form of the likelihood function 
further hints at an f{p) given by 

f(p) = [UPkJ • (9) 

Existence of the integrals defining the mean estimator for the classical die requires (3 > 0. 
For this choice of f(p), it is convenient to define moments 

M^(n 1 ,n 2 ,n 3 ) = 2 /"(dp) sfo-^Pk) II^" 1 ' ( 10 ) 
Jo V k k 

where the normalization factor of 2 is chosen for convenience of the quantum problem 
to be discussed later. The mean estimator, for a given value of (3, can then be written 
as a ratio of two moments, 

C \W) _ M /3( n i> • • • ; n k-i, n k + 1, w fc+ i, ■ ■ ■ , n K ) 
Mp{n u . . . , rifc-i, n k , n k+1 , ...,n K ) 

The moments for the classical die can be evaluated explicitly, 

M (n n n \ 2 T(m + /3) T(n 2 + g) ... T(n K + P) 

M,( ni ,n 2 , ...,n K ) = T (N + K(fi-1)) ' (12) 

where T(z) is the familiar Gamma function. Since F(z + 1) = zT(z), we thus have 

( . n k + P n „s 

{Pk>ME - jy + K p K 16 ) 

which can be rewritten as 

(rU (/3) - 1 n W + nk h W with n {l3) ~ 1 fln H ft (/3) - 1 fl/fl 
IWME - J7 a N ' 7W 7 N i Wlt11 a 7V = , N_ allCl °JV = V~W' ' ' 

1 + /3K 1 + at 
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2.3. The minimax estimator 

How should we choose the value of the parameter /3? We make use of a minimax 
approach: is chosen to minimize the worst-case (over all physical states) mean squared 
error (MSE), defined for state p with outcome probabilities p and estimator p with 
outcome probabilities p as 

MSE(p, p) = C(D N \p) \Pk - Pk(D N )f. (15) 

D N k 

While an arbitrary choice for quantifying the estimation error for using estimator p, 
the MSE is used here since it is particularly amenable to analytical manipulations. It 
is an often-used measure of estimation error in classical problems. More generally, 
tomography with a SIC-POM gives an MSE that is equal, apart from an overall 
irrelevant constant factor, to the mean squared Hilbert- Schmidt distance between p 
and p: MSE(p,p) oc J2d n £(Dn\p) tr {{p — p) 2 }- This relation holds even for quantum 
tomography. The average over measurement data is natural from the point of view of 
choosing a single estimation strategy that works for many runs of tomography of the 
same state, each of which can yield a different D^. 



For the mean estimator for the classical die given in (14), the MSE is given by 

MSE(p, pgj> ) = {N + ^ )2 [((3 2 K 2 - N)p 2 + (N- (3 2 K)} , (16) 

where p 2 = J2kPk- Noting that 4 < p 2 < 1 for any state of the classical die, we can 
consider the maximum (over all p) of the MSE for three different cases: (i) (3 2 K 2 > N, 
(ii) (3 2 K 2 = N, and (hi) (3 2 K 2 < N . The minimum (over /3) of the maximum MSE is 
attained for case (ii), where 



= V' (17) 

yielding the minimax mean estimator for the classical die, 

1 rifc, 1 1 
(Pfc)MM = -f7a N + —b N , with a N = = and b N ~ 1 



K JV 'N JV ' iV "l + ViV 1 y ~l + l/VN' 



Actually, the estimator in ( 18 ) is minimax not just over the class of mean estimators 
parameterized by /3, but is minimax over all estimators for the classical die problem. 
To see this, we observe that f3 = y/N / K results in an MSE that is constant over all 
states. It is known from estimation theory that a mean estimator with constant MSE 
is minimax, that is, it has the smallest worst-case MSE, over all estimators (see, for 
example, [TJ; or see [1] for a self-contained proof of the fact). This thus provides an 
objective justification for choosing a weight function f{jp) of the form (J9|. 



3. The qubit confined to a plane 

We now turn to the quantum problem of a qubit confined to the x-z plane of the Bloch 
sphere and explain how to adopt the same philosophy that led to the minimax estimator 
for the classical die in the quantum problem. 
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3.1. The trine measurement and physicality constraints 

As stated in the introduction, the trine POM is an S-POM with three POM outcomes 
built from three pure states symmetrically arranged in the x-z plane of the Bloch sphere, 
subtending angles of ^ between pairs of states. These states are collectively known as 
the trine states, 

1 2tt 
MM = 2 ( X + a zCos(p k + a x sin4> k ) , with 4> k = (p + (k - 1) — , (19) 

for k — 1,2, 3, where cr^s are the usual Pauli operators for describing two-dimensional 
systems. Here, 0o is a fixed angle that determines the orientation of the trine states in 
the x-z plane. The trine states are linearly independent and complete since 

3 

\{MTl>i)? = \&ki + \ and IX^lVfcXV'fcl = 1- (20) 

fc=i 

The outcomes of the trine POM are subnormalized projectors onto the trine states, 

II fc =|V*)!<V*|, ^ = 1,2,3. (21) 



As is required for a physical POM, (20) ensures ^ fc=1 hi = 1. Outcome probabilities 



for state p using the trine POM are given by 

p fc = trOt)n fc ) = -[l + rcos(0-0fc)], fc = l,2,3, (22) 

where we have used the fact that every physical state lying in the x-z plane of the Bloch 
sphere can be described in polar coordinates as 

p = - [1 + r[a z cos <p + o x sin <p)} , with < r < 1 and < <f> < 2n. (23) 

The Afc operators for the trine measurement are 

A k = |^fc)2(^fc| - ^ = ^ + o- z cos(p k + a x sin0 fc , k = 1,2,3, (24) 

and a state with trine outcome probabilities pk can be written as p = YlkPk-^-k, as 
previously explained. Every qubit state in the x-z plane of the Bloch sphere can be 
described this way, with a unique set of outcome probabilities, demonstrating that the 
trine measurement is informationally complete for this restricted set of qubit states. 



From (22), the trine outcome probabilities satisfy 



i< P ^£rf4( i+ H4 (25) 

for all physical qubit states. Here, we have used the identities sin 4>k cos 0^ = 
and ^ fc (cos(/>fc) 2 = ^ fc (sin0 fe ) 2 = |. Notice the difference here from the classical die 
problem. For a 3-sided classical die, the outcome probabilities satisfy no additional 
constraint apart from those that ensure they form a probability distribution (that is, 
J2kPk = 1 an d Pk > for all k, which also guarantee p 2 > |), and p 2 can be as large 
as 1. We can visualize the physical states of the classical 3-sided die as points on an 
equilateral triangle (the planar region defined by Y2kPk = 1 an d Pk > for all k in 
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Figure 1. The triangle contains all points (pi,P2)P3) such that J^kPk = an( ipfc > 
for all k. Every point in (or on) the triangle corresponds to a physically permissible 
state of the classical 3-sided die. The disk contains all points in the triangle that also 
satisfy the quantum constraint of ^2 k p1 < \- Every point in the disk represents a 
physically permissible state of the qubit, while points in the triangle outside the disk 
are unphysical. 



the P1-P2-P3 space), with vertices corresponding to the states with outcome probabilities 
ipx,P2iPz) — (1) 0, 0), (0, 1, 0) and (0,0,1) (see Figure [l]). Physical qubit states, with 
associated trine outcome probabilities, however, do not occupy the entire triangle. For 
example, the point (1,0,0) does not correspond to a qubit state, since the outcome 



probabilities violate constraint (25). Instead, physical qubit states reside on the disk 
inscribed within the classical equilateral triangle (the intersection of the equilateral 
triangle with the ball of radius ^= in the p\-pi-pz space). Points in the triangle outside 
of the disk correspond to states with at least one negative eigenvalue, and are hence not 
permissible qubit states. 

An estimator for the classical 3-sided die problem will report a point in the triangle. 
Estimators for the qubit with the trine measurement will, however, need to land only 
inside the disk. The ML procedure instructs us to look for the maximum of the likelihood 
function, for given data, constrained to the disk. Our procedure in [I] tells us to start 
with the estimator for the 3-sided die problem, and if it lies outside of the disk, to "pull 
it in" towards the centre until it lies on the boundary of the disk (or better yet, just 
inside the boundary — see [I] for more details). Below, we examine yet another approach 
to dealing with the constraint to the disk. 

3.2. The mean estimator for the trine 

Recall that the integration measure in the mean estimator (see ([6])) includes the 
characteristic function x(p)i which accounts for all physicality constraints. A natural 
way, then, to impose the constraint of p 2 < | for the trine in the mean estimator, is to 
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include it explicitly in x(p)i 



3 



x(p)^ i-Ep* ^U-jyiy ( 26 ) 

Here, r]( ) is Heaviside's step function: rj(x) = for x < and rj(x) = 1 for x > 0. 
As in the classical die problem, the delta function ensures YlkPk = 1j while > is 
enforced by the lower limit of integration in the mean estimator. Any mean estimator, 
with the above choice of x{p)i wm automatically be a convex sum (integral) only of 
physical qubit states (confined to the x-z plane of the Bloch sphere), and will hence be 
itself a physical state, or equivalently, will lie in the disk. 

We are still left with the choice of the weight function f(p). Apart from the 
additional constraint on p 2 , the trine problem has identical symmetries as the 3-sided 
die problem. This suggests consideration of the same f(p) function that worked well for 
the classical die (see (pi)), 

f(p) = [UpA ■ (27) 

(3 can be any real number as long as all the integrals in the mean estimator (for all data) 
exist. For the classical die, we needed (3 > 0; for the trine, the additional step function 
in x extends the range to (3 > — |. 

As in the case of the classical die, we can define moments for the trine problem, 

Mp( ni ,n 2 ,n 3 ) = 2^°°(dp) s(l - J^P^j ^(I'P 2 ) II ( 2 §) 
The normalization factor of 2 is chosen such that Mi (0,0,0) = 1. The trine moments 



look identical to those for the classical die (Eq. (10)), except for the additional step 
function, which contains the only visible quantum-mechanical feature of the problem. 

The seemingly harmless addition of the step function results in an integral that 
is difficult to do explicitly. Instead, we begin with Mi (0,0,0) = 1, and obtain 
the exact values of Mp(rii, n 2 , n 3 ) for positive integer values of (3 with the aid of 
recurrence relations. The moments for non-integer values of (3 > 1 are obtained by 
interpolating between adjacent integer-/? moments. The moments for f3 between 
and 1 are numerically computed by first performing numerical integration to obtain 
Mo(rai, ri2, n^) and then interpolating with Mi(ni, ri2, ns) to obtain the remaining non- 
integer (3 values. The moment values for (3 between — | and 0, because of the approach 
to the singularity at (3 = — |, require more care to compute numerically. However, since 
negative (3 values turn out to not be needed except for values of N(< 30) so small as to 
be irrelevant to tomography, we will focus only on computing the moments for (3 > 0. 
These steps are sufficient for us to assess the efficacy of the mean estimator, compute 
the minimax estimator over the allowed (3 values, and compare the performance to other 
estimators. 

The integral defining the moments can be simplified by using polar coordinates to 
parameterize the domain space, thereby getting rid of the delta function and the step 



Minimax mean estimator for the trine 



11 



function. We write the probabilities p k s in terms of the polar coordinates r, as in (22) 
Without loss of generality, we can choose the trine orientation such that 0o = 0. Then 



M f} (n 1 ,n 2 ,n 3 ) 



27 Jo 



Y[[l + rcos((f)-(j) k )] 



n k +/3-l 



(29) 



The moments have permutation symmetry, 

M p (k, I, m) = M p (l, m, k) = M p {m, k, I) = M p (l, k, m) = M p (m, I, k) = M p {k, m, I), (30) 
and obey a sum rule, 

Mp(ni + 1,712,713) + M^(n 1 ,n 2 + l,n 3 ) + Mp(n 1 ,n 2 ,n 3 + 1) = Mp{n u n 2 , n 3 ), (31) 
since p\ + p 2 +Pz — 1. Note also a useful identity, 



P1P2P3 



1 - -r z 
4 



4 



' cos(30) 



(32) 



Recurrence relations for integer values of /3 > 1 

We begin with the moment Mi(0, 0, 0) = 1 and note that 

Mp(m,n 2 ,n 3 ) = M 1 (n 1 +/?-l,n 2 + /?-l,n 3 + /?-l). 



(33) 



Hence, all moments for positive integer values of /3 can be obtained from the moments 
Mi(ni, ri2, 7i 3 ) for /3 = 1 with integers ni,n 2 ,n 3 > 0. For notational simplicity, we drop 
the subscript '1' whenever we are discussing moments for j3 = 1, 



d0 



M(n 1 ,n 2 ,n 3 ) = Mi^n,,^) = 2 dr r pTpTpT- (34) 

JO J(2tt) ^ 

The goal here is to develop recurrence relations that connect M(ni,n 2 ,n 3 ) to 
moments with smaller values of n k s. Observe that differentiating the p^. k in the integrand 
of M(ni, n 2 , n 3 ) will decrease its power from n k to n k — 1, taking a step towards our 
goal. To implement this differentiation, we make use of the two-dimensional gradient 
operator, 

d „ d ^ d . 1 9 



V = e x — + e z 



ox oz or r 



2ty 



m no n^t 

Pi P2 P3 



r=l 



(35) 



(36) 



and introduce the surface moment, 

L(n 1 ,n 2 ,n 3 ) = dr r — V ■ {rp^p^p^ 3 ) = 

JO J(2w) Z7V J(2n) 

where r = xe^ + ze z = re r . In the second equality, we have used Gauss's theorem 
to convert the integral over the disk into an integral over its circumference (r = 1). 
Performing the divergence operation in the integrand of L, we obtain the following 
relation, 



(N + 2)M(m, n 2 ,n 3 ) - 2L(m, n 2 , n 3 ) 

- n x M{ni - 1, n 2 , n 3 ) + n 2 M(n 1} n 2 - 1, n 3 ) + n 3 M(n 1} n 2 , n 3 - 1) 



(37) 
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The left side of the equation involves moments with total number of clicks N = 
n i + n 2 + n 3 ; the right side involves moments with N — 1 clicks. Given the moments for 
N — 1 clicks, we can use this recurrence relation to compute the moments for iV clicks, 
provided we can also compute the L moments for N clicks easily. 

To obtain recurrence relations for the surface moments, we again try to differentiate 
the pkS, this time with respect to <p. Specifically, we begin with the identity 
dd> d 



(27T) 



2tt 



sin i 



n-\ no ri'i 
PlP2P3 



0. 



(38 



Carrying out the <fi derivative in the integrand above and re-expressing the result in 
terms of the L moments, we obtain the relation 

3(N + l)L(m + l,n 2 ,n 3 ) 
= (N + 1 + ra 1 )L(ni, n 2 , n 3 ) + n 2 L(n 1 + 1, n 2 - 1, n 3 ) + n 3 L(ra 1 + 1, n 2 , n 3 - 1) 



-n 2 L{n 1 ,n 2 - l,n 3 ) - -n 3 L(ni,n 2 ,n 3 - 1). 



(39) 



2~'-^~"~' 1 °> 2 
The first line of the equation involves N + 1 clicks, the second line involves iV clicks, 
and the last line N — 1 clicks. Permutation symmetry yields similar recurrence relations 
for L(ni, n 2 + 1, n 3 ) and L(n\, n 2 , n 3 + 1) in terms of moments involving fewer clicks. 

The recurrence relations (37) and (39 ), together with the initial values M(0, 0, 0) = 
L(0,0,0) = 1, generate exactly all moments Mp{n\,n 2 ,n 3 ) for non-negative integers 
nx,n 2 and n 3 , and (3 taking integer values > 1. One can now put these recurrence 
relations into a computer and efficiently compute, to a desired precision, the value of 
any moment with integer > 1. For even better numerical accuracy and speed, one 
can also make use of additional formulas for special cases of the moments, such as 



M(n, 0,0) 
L{n, 0,0) 
L(n, 1,0) 
L(n,2,0) 
L(n, n, n) 



ni 



(n + 2)! 
(2n)! 



fe=i 



(A;!) 2 



+ 



[n 

n+l 



(2n)! n + 2 
n + l' 



(n!) 2 
n+2 (2n)! 
(n!) 2 



2(4n + 5) 
(n + l)(n + 2) 



1 \ n (2n-l)! 
216/ (n - 1)! n! 



(40) 



Moments for /3 = and non-integer /3 

The moments Mo(ni, n 2 , n 3 ) are computed approximately by direct numerical 
integration. One can verify the accuracy of the numerical integration routine by 
comparing the results to exact values like those for integer 0, or to the exact value 

M o (0, 0, 0) = 7rl2V3, (41) 
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for which the integral can be done analytically. 

For non-integer values of ft > 0, we interpolate between the two nearest integer ft 
values. Numerically, we find that the logarithm of Mp, for fixed ni, n 2 , n 3 values, is well- 
approximated by a linear function of ft. An exponential interpolation, or equivalently, 
a linear interpolation of the logarithm of Mp, hence works well, 

Mp(n 1 ,n 2 ,n 3 ) _ / M [/3J+ i(ni, n 2 , n 3 ) \ P ~ m 
Mipj(ni,n 2 ,n 3 ) V ^L-8j( n i' n 2,n 3 ) / 

where |_ /3 J denotes the largest integer < ft. 

For ni,n 2 , n 3 ~ N ^> 1, we can understand this exponential behaviour. When n^s 
are large, the integrand of Mp, viewed as a function over the disk in Figure [TJ is sharply 
peaked about the centre of the disk. This means that it makes little difference whether 
we integrate only over physically allowed p^s (those in the disk), or over all p^s in the 
entire triangle. This allows us to approximate Mp(n\, n 2 , 713) for the trine problem by 
that of the classical 3-sided die problem, or equivalently, to ignore the step function 



in x(p)- Then, using the explicit formula for the classical moments given in (12) and 
invoking Stirling's formula n\ ~ \/27rn n+1 ^ 2 e~ n to approximate the Gamma functions, 
we obtain 

Mp(m,n 2 ,n 3 ) _ fni 712713 ^ 



M m {m,n 2 ,n 3 ) \NNNJ 



exactly of the form of (42). Numerically, we find that this exponential interpolation 
works well even for small values. 

These recurrence relations and interpolations allow us to compute, as a ratio of 



moments (see (11)), the mean estimator p^ E for the trine for any ft > and any 
data Dn ~ {ni, 712,713}. For integer ft > 1, the moments are exact, and the resulting 
estimated probabilities (p/c)me are a ^ so exac t- F° r ft — 0, the estimated probabilities 
are accurate up to the precision of the numerical integration used to compute those 
moments. For non-integer values of ft where interpolation is done, the moments, 
and likewise the estimated probabilities, are only approximate. In particular, because 
the exponential interpolation used for non-integer ft does not respect the constraint 
J2kPk = 1) we en d up with estimated probabilities that violate this constraint, but only 
by a small amount (typically less than 3 percent). A simple remedy is to normalize the 
estimated probabilities, and we verify numerically that these normalized probabilities 
also satisfy the quantum constraint of p 2 < \. 

3.3. Minimax mean estimator for the trine 

What remains is to choose the optimal value of ft to use. As in the classical die problem, 
we choose ft by a minimax approach, and define the optimal ft as that which attains 
the minimax MSE, 

nun max MSE^p, p^e) • ( 44 ) 
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Unlike the classical die, we can only perform this optimization numerically. The optimal 
value, as well as the resulting minimax MSE, are reported in Figures [2] and [3] 
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Figure 2. Plot of the optimal /? values for the minimax mean estimator applied 
to the trine problem versus N (solid black line with circular markers). Shown on the 
same plot for comparison are the optimal (3 values for the minimax estimator for the 
analogous 3-sided die problem (see (17)) (dashed red line). Inset: The solid blue line 
gives the MSE as a function of (3 for N = 100; the dashed black line indicates the value 
of MSE that is one percent more than the minimum MSE value. 



In Figure |2j we give the optimal value as a function of N, the total number 
of copies measured. The optimal starts close to zero for N = 30 and increases 
monotonically as N increases, in a manner suggestive of a viV behaviour. In the same 
figure, we have also plotted the optimal value of \/N /3 for the 3-sided die problem. 
The shapes of the two curves are qualitatively similar. 

One can understand qualitatively the offset in the values between the classical 
and the quantum problems. For the classical problem, for > 1, f(p) is a function 
that is large near the centre of the triangle in Figure [TJ and small near the boundary. 
The larger the value of 0, the smaller the weight assigned to boundary states compared 
to states near the centre. For the trine problem, f(p) has a similar behaviour, but the 
states outside the disk are given zero weight because of the step function in x- This 
step function can be thought of, heuristically, as making the overall weight more peaked 
about the centre, as if f(p) has a larger value of 0. This reasoning agrees with the 
observation that the optimal for the trine problem is smaller than that for the same 
iV for the classical 3-sided die. 
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One should note that the minimum over (3 for a given N is not a sharp one. As 
illustrated in the inset of Figure [2j the value of the MSE for N = 100 varies only by 
about one percent when we move about 0.2 away from the optimal (3. This behaviour 
is also observed for other values of N. This means that, whether we take (3 equal to the 
optimal value as given in Figure [2j or ±0.2 of the optimal value, the performance of the 
mean estimator does not change significantly. Of course, if one moves too far from the 
optimal value of /3, we see a marked increase in the MSE, but around the optimal point, 
the exact value of (3 does not matter much. After all, it is meaningless to compute the 
estimator to a precision beyond that justified by the data, which, in practice, will be 
polluted by some level of noise. 
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Figure 3. We plot the maximum and minimum MSE (over all states) for three 
different estimators: (1) the minimax mean estimator (using the optimal /3 value for 
each N) (solid block line with circular markers), (2) the ML estimator for the trine 
measurement (solid blue line), and (3) the minimax estimator from [4] (dashed red 
line). 

Figure [3] gives the maximum and minimum MSE for the minimax mean estimator 
(with the optimal (3 value for each N). Comparing these with the MSE for the ML 
estimator for the trine measurement (solid blue line) [8], we see that the performance 
of our minimax mean estimator is slightly better in terms of slightly smaller maximum 
MSE, but the minimum error is slightly higher. 

We have also plotted the MSE for the corrected minimax estimator from [I] for the 
trine measurement. For this estimator, the maximum MSE is noticeably smaller than 
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either the ML estimator or the minimax mean estimator. The minimum MSE is also 
significantly higher than either, but this actually gives the corrected minimax estimator 
the nice feature that the MSE is nearly constant over all states, as indicated by the close 
values of the maximum and minimum MSE. Given that we typically have little or no 
prior information about the state we are given, a constant MSE provides an objective 
way of treating every state in a fair manner. 

The comparisons with the ML estimator and the corrected minimax estimator 
suggest that the minimax mean estimator discussed in this paper does not provide much 
advantage. If one is concerned with having a smaller minimum MSE (for example, if one 
knows that the states that attain this minimum value are more likely to occur), and does 
not mind the slightly higher maximum error, then, one would use the ML estimator. If 
one prefers a lower worst-case performance, or require a fair treatment of every state, 
one would use the corrected minimax estimator. The minimax mean estimator, despite 
its natural generalization from the minimax estimator for the analogous classical die 
problem, does not perform quite well enough to justify the complicated procedure 
required to compute it. 

An alternate way to generalize the minimax estimator for the classical die problem 
to the quantum case is to consider a different form of the weight function /. In our 
analysis above, because of the mathematical similarities with the classical die when 
viewed in terms of the probabilities pk, it was natural to utilize exactly the same f(p) 
as in the classical problem. Another possibility is to choose 

f{p) = [det(p)f-\ (45) 

where p is considered as a function of the pi~s. For the classical die, where every state 
p can be expanded in terms of the basis that describes the measurement (see Q), the 
determinant of p is nothing but a product of the pkS, giving the equivalence between the 
choice of / either as in & or in (45). For the quantum case, the two choices differ. (45) 
might also be a plausible choice of / since it was previously used in [9], in the context 
of hedged maximum likelihood, as an additional weight function to generalize the same 
classical estimator in (13) to the quantum regime. With this alternate choice of /, one 
can employ the same techniques as described above to construct the minimax mean 
estimator for this weight function. Our preliminary investigations into this, however, 
indicate that the performance (in terms of the MSE) of the resulting estimator is very 
similar to that of the minimax mean estimator described before and the alternate choice 
of weight function offers no advantage over our previous choice. 

In fact, considering an f(p) that is a product of the pkS, as we have done above, 
is more attractive than the choice in (45). This is because the product of PkS can be 
thought of as explicitly incorporating information about our choice of the tomographic 
measurement. This knowledge about the measurement used, from a perspective of 
interpreting the weight function as encompassing one's prior information, should enter 
the construction of the estimator. On the other hand, det(p) does not single out any 
particular measurement — one would write down the same function regardless of the 
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measurement used — and it depends on the pkS only implicitly through p. In any case, 
either choice gives similar MSE values that do not quite outstrip the performance of 
previously known, simpler estimators. 

4. Conclusions 

Motivated by the classical die problem, we derived the minimax mean estimator for 
the tomography of a qubit restricted to the x-z plane of the Bloch sphere with the 
data acquired by a trine measurement. The trine problem has many similarities to the 
classical 3-sided die problem, with a single key difference being an additional physicality 
constraint imposed by quantum mechanics. The similarities invite us to apply the same 
minimax approach used in the classical problem to look for a good estimator for the 
quantum problem. That this would be a good idea is reinforced by our earlier work in 
[I] where a simple ad-hoc adaptation of the classical estimator to the quantum problem 
worked very well. The mean estimator used for the classical problem also provides a very 
natural and elegant framework for incorporating the quantum constraint. Nevertheless, 
we find, somewhat surprisingly, that the resulting minimax mean estimator does not offer 
much advantage over simpler estimators like the ML estimator or the corrected minimax 
estimator. It yields slightly better worst-case performance than the ML estimator, but 
the small gain does not warrant the additional complications required to compute it. 
This is a reminder to us how much more complex the quantum world can be, and how 
intuition from classical problems can sometimes fail in translation to the quantum case. 

An important step forward will be to explore higher dimensions and other choices 
of tomographic measurements. As the dimension of the system grows, the numerical 
complexity will undoubtedly increase. Nevertheless, it is pertinent for us to question 
if the above conclusions hold in higher dimensions, since the qubit is often a rather 
special case. Another possible future direction is to study the behaviour of the minimax 
mean estimator for a different figure-of-merit than the MSE. While the MSE is a 
convenient and often-used choice of estimation error, there are certainly scenarios for 
which alternative measures of assessing the performance of the estimation strategy (for 
example, the mean trace distance or relative entropy) will be more appropriate. 
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